A Novel Approach to Dropped Pronoun Translation
نویسندگان
چکیده
Dropped Pronouns (DP) in which pronouns are frequently dropped in the source language but should be retained in the target language are challenge in machine translation. In response to this problem, we propose a semisupervised approach to recall possibly missing pronouns in the translation. Firstly, we build training data for DP generation in which the DPs are automatically labelled according to the alignment information from a parallel corpus. Secondly, we build a deep learning-based DP generator for input sentences in decoding when no corresponding references exist. More specifically, the generation is two-phase: (1) DP position detection, which is modeled as a sequential labelling task with recurrent neural networks; and (2) DP prediction, which employs a multilayer perceptron with rich features. Finally, we integrate the above outputs into our translation system to recall missing pronouns by both extracting rules from the DP-labelled training data and translating the DP-generated input sentences. Experimental results show that our approach achieves a significant improvement of 1.58 BLEU points in translation performance with 66% F-score for DP generation accuracy.
منابع مشابه
Dropped personal pronoun recovery in Chinese SMS
In written Chinese, personal pronouns are commonly dropped when they can be inferred from context. This practice is particularly common in informal genres like Short Message Service (SMS) messages sent via cell phones. Restoring dropped personal pronouns can be a useful preprocessing step for information extraction. Dropped personal pronoun recovery can be divided into two subtasks: (1) detecti...
متن کاملTranslating Pro-Drop Languages with Reconstruction Models
Pronouns are frequently omitted in pro-drop languages, such as Chinese, generally leading to significant challenges with respect to the production of complete translations. To date, very little attention has been paid to the dropped pronoun (DP) problem within neural machine translation (NMT). In this work, we propose a novel reconstruction-based approach to alleviating DP translation problems ...
متن کاملA Stylistic Approach to Translation: Figurative language devices in the Persian renderings of Alcott's Little Women
The present study aimed firstly at investigating the impact of translators' style on figurative language translation from English into Persian. Secondly, it intended to find which strategies were most frequently adopted by Persian translators to translate figures of speech into Persian. Lastly, the study sought to check the extent of transference of figurative features of literary texts in Engl...
متن کاملA Stylistic Approach to Translation: Figurative language devices in the Persian renderings of Alcott's Little Women
The present study aimed firstly at investigating the impact of translators' style on figurative language translation from English into Persian. Secondly, it intended to find which strategies were most frequently adopted by Persian translators to translate figures of speech into Persian. Lastly, the study sought to check the extent of transference of figurative features of literary texts in Engl...
متن کاملPronoun-Focused MT and Cross-Lingual Pronoun Prediction: Findings of the 2015 DiscoMT Shared Task on Pronoun Translation
We describe the design, the evaluation setup, and the results of the DiscoMT 2015 shared task, which included two subtasks, relevant to both the machine translation (MT) and the discourse communities: (i) pronoun-focused translation, a practical MT task, and (ii) cross-lingual pronoun prediction, a classification task that requires no specific MT expertise and is interesting as a machine learni...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016